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^^ ' Abstract 

(N 

%^ , Fifty years ago Erdos asked to determine the minimum number of fc-cliques in a graph on n 

vertices "with independence number less than I. He conjectured that this minimum is achieved by 
the disjoint union oi I — 1 complete graphs of size j^j. This conjecture "was disproved by Nikiforov 
ff^ , "who sho-wed that the balanced blo"w-up of a 5-cycle has fe"wer 4-cliques than the union of 2 complete 

graphs of size ^ . 

In this paper we solve Erdos' problem for {k,l) — (3,4) and {k,l) — (4,3). Using stability 
^^ , arguments "we also characterize the precise structure of extremal examples, confirming Erdos' 

r) ' conjecture for (fc, I) = (3, 4) and sho"wing that a blo'w-up of a 5-cycle gives the minimum for 

^•' (fc,0 = (4,3). 

-)— » ' 
2 ■ 

1 Introduction 

. Let Ki denote a complete graph on I vertices and let Ki be its complement, i.e., an independent set 

ff^ . of size I. One of the central results in extremal combinatorics is Turan's theorem [15], "which asserts 

^N I that the maximum number of edges in a Ki-bee graph on n vertices is attained by the Turan graph 

^\| I r„ /_!, a complete (/ — l)-partite graph "with nearly-equal parts. This theorem has since been extended 

QT) [ and generalized in many different "ways. Since an edge can be thought of as a clique on 2 vertices, 

^^ I a natural generalization is to ask for the maximum number of K^ in an n- vertex graph with no Ki. 

Zykov [16] showed that this maximum was also attained by the Turan graph Tn^i-i. 

For any integers k,l > 2 and n, we define /(n, k, I) to be the minimum number of copies of K^ in 

a i^;-free graph on n vertices. If one takes the complements of the graphs in Turan's theorem, then 

?-H ■ the theorem gives the minimum number of edges in an n-vertex i^;-free graph. Thus the question of 

C^ « 
- - determining f{n,k,l) is precisely the Zykov-type generalization of this complementary version. Fifty 

years ago Erdos [1] asked to determine f{n,k,l) and conjectured that the minimum is given by the 

complement of the Turan graph, T„ /_i, which is the disjoint union of I — 1 complete graphs of equal 

size. When k = 2, this follows from Turan's theorem. 

Note that a graph is i^z-free precisely when its independence number is less than /. One can thus 

also view this problem as a strengthening of Ramsey's theorem, which states that any sufficiently 
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large graph either has a chque of size k or an independent set of size I. The (fc, /)-problem asks how 
many chques of size k a graph must have when its independence number is less than /. 

Lorden [8] proved Erdos' conjecture to be true for the (3, 3)-case by a simple double-counting 
argument. However, no further progress was made in the next forty years, until Nikiforov ^ disproved 
the conjecture in the case (4,3) by showing the balanced blow-up of C5, which is i^s-free, contains 
fewer 4-cliques than the disjoint union of two cliques, r„^2- In a blow-up of a graph, we replace 
every vertex with a clique, and every edge with a complete bipartite graph. We say the blow-up is 
balanced if the cliques are all of the same size. In a subsequent preprint [10], Nikiforov showed that 
his construction is optimal under the additional assumption that the graph should be nearly-regular. 

Moreover, by considering blow-ups of Ramsey graphs, Nikiforov showed that the conjecture could 
only hold for finitely many (k, I) when k,l > 3. In particular, he conjectured that equality held only 
for the cases (3, 3) and (3, 4), the latter of which remained an open problem. 

1.1 Our results 

In this paper, we first sharpen Nikiforov's result by showing that Erdos' conjecture is always false 
when k > A and / > 3, or when k = 2> and / > 2074. We obtain these results through a combination 
of explicit and random counterexamples. 

We then solve the problem in the cases (A:,/) = (4,3) and (3,4). Using the machinery of flag 
algebras developed by Razborov [T2], we are able to obtain the asymptotic values of /(n, 4, 3) and 
/(n, 3,4). By analyzing the corresponding semi-definite programming solutions, we are then able to 
derive stability results for these cases, which in turn allow us to determine /(n,4, 3) and /(n,3, 4) 
exactly for large n, and also to characterize the extremal graphs. In particular, we show that a 
blow-up of C5 is indeed optimal for the (4, 3) problem, while Erdos' conjecture holds for the (3, 4) 
problem. Our results are summarized in the following theorems. 

Theorem 1.1. /(n,4, 3) = 25(4) + O(n^), where the minimum is achieved by a blow-up of C^ with 
five parts of roughly equal sizes. Moreover, the extremal structure is unique for sufficiently large n. 

We determine the exact sizes of the parts of the blow-up by solving an integer optimization problem, 
the precise results of which are given in Section [H 

Theorem 1.2. /(n,3,4) = (L^/^J) _^ (|L(n+^i)/3j^ _^ ^l{n+2)/3]^^ ^ i^^ ^^^^^ j^^ ^^^^^ ^ ^^^^ minimum 
is achieved by three disjoint cliques that are as equal as possible. Moreover, any extremal graph must 
be spanned by three such cliques. 

Note that in this case the extremal graph is not unique, as we may have partial matchings between 
the cliques without introducing any extra triangles. 

As we remark in our concluding section, solutions of corresponding SDP problems strongly sug- 
gest that a disjoint union of cliques remains optimal for the (3, 5)- and (3, 6)-problems, contrary to 
Nikiforov's conjecture. 



1.2 Notation and organization 

Given a graph G on vertices V{G)^ and a vertex v G V{G), we denote by N{v) the set of neighbors of 
V in G, and by N(v) the set of non-neighbors of v. The complement graph G shares the same vertices 
as G, and has an edge {u, v} if and only if {«, v} is not an edge of G. We denote the independence 
number of G by a{G). The complete graph on k vertices is denoted by K^. In particular, a graph 
G is Ki-iree if and only if a{G) < I. Some other graphs we will use are the cycles Gk, and paths Pk 
where in each case the subscript refers to the number of edges. 

Given a fixed graph H, for any graph G we let tniG) denoted the number of induced copies of H 
in G. In the case H = K/., we simplify the notation to tk{G). Using this notation, we can define 

f{nXl) = min{tfc(G) : \V{G)\ = n,ti(G) = 0}. 

The rest of the paper is organized as follows. In the next section, we construct counterexamples 
to Erdos' conjecture in the case k > 4 and / > 3 or A; = 3 and / large. In Section [3l we provide an 
informal introduction to our main tool, fiag algebras. Sections U] and [5] contain the proofs of our main 
results for the (4, 3)- and (3, 4)-problems respectively. The final section contains some concluding 
remarks and open problems. 

Some technical details are given in the appendices: Appendix [A] provides some remarks regarding 
implementation of flag algebras, and Appendix[B] contains the proof of the integer optimization result 
for the (4, 3)-problem. 

2 Counterexamples to Erdos' conjecture 

Nikiforov ^ showed that not only was Erdos' conjecture not true in general, but that it held only 
finitely often. He used bounds on the Ramsey numbers -R(3, /) to show the existence of ko and Iq 
such that whenever k > kQ or I > Iq, blow-ups of Ramsey graphs did better than disjoint unions of 
cliques Tn^i-i- In the following theorem, we use a combination of explicit and random constructions 
to further improve this result. 



Theorem 2.1. T„^/_i is not optimal for the (k,l)-problem when 
(i) k > 4 and I > 3, or 
(ii) k = 3 andl> 2074. 

2.1 The {k, /)-problem with k>4 

Let us first consider the case I = 3. That is, we are looking to minimize the number of fc-cliques in 
a graph with independence number at most 2. For the (4, 3)-problem, Nikiforov |10) gave an explicit 
counter-example to Erdos's conjecture by showing that a blow-up of C5 contains fewer triangles than 
the graph Tn,2, which consists of two disjoint cliques. In fact, it is easy to see that this construction 
is better than Tn^2 for any k > 4. Indeed, a disjoint union of two cliques contains, asymptotically, 
2(1) ~ 2^(fc) ^-cliques. On the other hand, the blow-up of C5 contains 5 ( ( | ) — (|) j ~ 'Ik-l {1) 



fc-cliques. For A; > 4, we have 4^^ < p^") a-i^d so T„^2 is asymptotically not optimal for the {k,3)- 
problem. 



For / > 4, the graph T„ ;_i consists of / — 1 disjoint cliques. However, as shown above, if we 
replace two of these cliques with a blow-up of C^ on the same number of vertices, we will reduce the 
number of /c-cliques. Formally, this construction has a blow-up of C5 on five parts of size ctjztt' ^^'^ 
I — 3 disjoint cliques of size j^, and contains fewer fc-cliques than T„^;_i. This shows that a disjoint 
union of cliques is not optimal for the (fc, Z)-problem for any k> A and I > 3. 

2.2 The (3, /)-problem 

The situation is quite different when k = 3. As we will show later, the disjoint union of cliques is 
optimal for the (3, 3)- and (3, 4)-problems. However, unlike the case k = 2, this construction ceases to 
be optimal for large values of I. We consider the random graph G ~ G{m,p) on m vertices, with every 
edge appearing independently with probability p. For suitable parameters l,m, and p, we show that 
with positive probability the balanced blow-up of G has no independent set of size I and has fewer 
triangles than Tn^i^i- First we count the number of triangles in a balanced blow-up of an m- vertex 
graph G to n vertices. 

There are three ways to obtain a triangle in the blow-up. The vertices of the triangle can all come 
from one part, in which case there are — vertices to choose from. As there are m vertices in G, there 
are 7ti,(™) ~ T^is) such triangles. Alternatively, the vertices of the triangle can come from an edge in 
G, with two vertices from one part, and the third vertex from the other. There are two ways to split 
the vertices, and e(G) edges, so the total number of such triangles is 2e(G) (^) ( Y ) '^ ^3 (3) . Finally, 
the vertices of the triangle can come from a triangle in G, with one vertex from each of the three 
parts. There are t^^G) triangles in G, and so the number of such triangles is t^iG) (^) -^ ^ 3 ' (g) . 
Thus the total number of triangles in the blow-up of G is asymptotically ( 3^ H j I (T). 

On the other hand, Tn^i-i has {I — l)('-i) ^ n\\i (3) triangles. Thus to obtain a counter- 
example to Erdos's conjecture, we need to show that for some l,m and p, with positive probability 
the random graph G ~ G{m,p) has no independent set of size / and ^^^ ' 3^^ '' -\ j < tttttW) ot 

3 

e{G) + tslG) < Q(l^iY2 — ^- Let us call such a graph 'suitable'. 

Let Bi be the event that a{G) > /, where a{G) is the independence number of G. For some 
parameters s and t, let B2 be the event {e(G) — E[e(G)] > s}, and B3 the event {t3(G) — E[t3(G)] > t}. 
If E[e(G) + t3(G)] + s + t< ^^ - f , then {e(G) + h{G) > ^0^ - f } C i?2 U B^. Then we 
have 

F{G not suitable) < ¥{Bi UB2U B3) < ¥{Bi) + ¥{B2 U -B3). 

We use a union bound for Bi: there are (™) sets of I vertices, and the probability that a given 
set has no edges is (1 — p)V2J. Using the bound (") < (^) , we have 

Note that the other two events are increasing; that is, they are preserved by the addition of edges. 



It then follows from Kleitman's Lemma (see Chapter 6 in [T]) that P(i?2 H B^) > F{B2)P{B-i), and so 

p(S2Ui?3) = nB2)+nB3)-¥{B2nB3) < F{B2)+nB3)-nB2mB3) = p(fi2)+p(S3) (1 - nB2)) . 

Moreover, since the right-hand side is increasing in both P(i?2) and P(i?3), we can replace the proba- 
bilities with upper bounds to obtain an upper bound on P(i?2 U B3). To obtain these upper bounds, 
we use the following second moment concentration inequality from |Tj: 

Proposition 2.2. Let X be a random variable with expectation E,[X] = fj, and variance a'^ . Then for 
all A > 0, 



¥{X - /i > A) < 



a^ 



A2 + a2 • 

For the event B2, with X = e(G), we have X ~ Bin((^),p), and so /i = (™)p and o"^ 

)p(i-p) 



(™)p(l -p). This gives P(i?2) < ,.^(..)^(,_p) - 

For the event -B3, let X = t^{G). There are (T) possible triangles, each of which appears with 
probability p^, and hence ^ = ('?)p^- To find the variance, we note that any fixed triangle T is 
independent of all triangles except those that share at least two vertices with T. A quick calculation 

gives .' = Op' [(1 - p3) + 3(™ - 3)/(l - rt] . Thus rlB,,) < .'^^^i-C^^^^^^^^I'olr 

m^ m 

6(1-1)''' 6 



Thus if we can find /, m,p, s and t such that Cli)p + ('?)p^ + s + t < ^,™,w — ^, and 



I 






(T>(i-P) 



<1, 



then we prove that there is a suitable graph, and therefore T^^i-i is not optimal for the (3, /)-problem. 
A computer search determined that I = 2074, m = 164397, p = 0.0051707, s = 14000 and 
t = 35000 are suitable values. Hence the graph with 2073 disjoint cliques is not optimal for the 
(3, 2074)-problem. Moreover, if / > 2074, then in T^^i-i we can replace 2073 cliques by a graph with 
fewer triangles. Hence T„ ;_i is not optimal for the (3, /)-problem for any I > 2074. 



It would be interesting to find better constructions and to determine when T^^i-i stops being 
optimal for the (3, /)-problem. Our flag algebra calculations suggest that it is still optimal for at least 
the (3, 5)- and (3, 6)-problems. 

3 Flag algebra calculus 

In this section we provide a brief introduction to the technique of flag algebras. First introduced by 
Razborov in [12j, it has been applied with great success to a wide variety of problems in extremal 
combinatorics (see, for example, [3l [Sj l6l [TT t 113 1 [H]). 

We will begin with a general overview of the calculus, by introducing some key definitions and 
providing some intuition behind the machinery. The second subsection will show how we express 
extremal problems in the language of flag algebras. In Appendix [X] we discuss some practical consid- 
erations regarding implementation of the method, to explain how we obtained our results in the later 
sections. 



It is neither our goal to be rigorous nor thorough, but rather to emphasize that the combinatorial 
arguments behind the flag algebra calculus are as old as extremal combinatorics itself. Indeed, the 
main tools available to us are double-counting and the Cauchy-Schwarz inequality. To highlight this 
fact, we will use the (3, 3)-problem as a running example, and indeed, the proof we obtain through 
flag algebras will be essentially the same as the original proof Lorden gave in 1962. 

The flag algebra calculus is powerful because it provides a formalism through which the problem 
of finding relations between subgraph densities can be reduced to a semi-definite programming (SDP) 
problem. This in turn enables the use of computers to find solutions, with rigorous proofs, to problems 
in extremal combinatorics. For a more complete survey of the technique, we refer you to the excellent 
expositions in [7j and [11], while for a technical specification of fiag algebras, we refer you to the 
original paper of Razborov |12j . 



3.1 Basic definitions and notation 

The flag algebra calculus is typically used to flnd the extremal density of some flxed subgraph J 
amongst graphs that avoid some forbidden subgraph. For our example, the (3, 3)-problem, we wish 
to minimize the density of triangles K^ in graphs that do not contain K^, the empty graph on 3 
vertices. While our definitions will be general, all our examples will come from this setting. 

We say that a graph is admissible if it contains no induced copies of the forbidden graph. A type 
a is an admissible labeled graph on vertices [k] for some non-negative integer k called the size of a, 
denoted by \a\. In what follows, an isomorphism between graphs must preserve any labels that are 
present. 

Given a type o", a cr-flag is an admissible graph F on a partially labeled vertex set, such that the 
subgraph induced by the labeled vertices is isomorphic to a. The underlying graph of the flag F is 
the graph F with all labels removed. The size of a flag is the number of vertices. Note that when a 
is the trivial type of size (denoted by a = 0), a cr-flag is just an usual unlabeled admissible graph. 
We shall write Tf for the collection of all cr-flags of size /. Let T'^ = U/>o-^r- When the type a is 
trivial, we shall omit the superscript from our notation. 

Let us now define two fundamental concepts in our calculus, namely those of flag densities in larger 
flags and graphs. Let o" be a type of size k, let m > 1 be an integer and let {Fi}^^ be a collection of 
a-flags of sizes k = \Fi\ > k. Given a cr-flag F of order at least I = k + Yl^ii^i — k), let T CI V{F) 
be the set of labeled vertices of F. Now select disjoint subsets Xi C V{F) \ T of sizes \Xi\ = li — k, 
uniformly at random. This is possible because F has at least ^^{li — k) unlabeled vertices. Denote 
by Ei the event that the c-flag induced by T U Xj is isomorphic to -Fj, for i G [m]. We define 

def 

Pa-{Fi,F2, . . . , Fm] F) = F{ri^^Ei) to be the probability that all these events occur simultaneously. 
If G is just an admissible graph of order at least /, and not a cr-fiag, then there is no pre-labeled 
set of vertices T that induces the type a. Instead, we uniformly at random select a partial labeling 
L : [fc] — 7- V{G). This random labeling turns G into a cr'-flag Fl, where the type a' is the labeled 
subgraph induced by the set of vertices L([A;]). If a' = a, we can then proceed as above, otherwise we 
say the events Ei have probability 0. Finally, we average over all possible random labelings. Formally, 



let Y be the following random variable 

de_f J Pa{Fl,F2,.. 



Y 







otherwise 



def 



Define da{Fi, . . . ,Fm; G) = E(y) as the expected value of the random variable Y . The quantities 
PaiFi, F2, . . . , Fm] F) and do-{Fi, F2, . . . , Fm] G) are called flag densities of {-Fi}ig[m] in F and in G, 
respectively. Clearly these flag densities are the same whenever cr = 0, in which case we omit the 
subscript from both notations. 

To better illustrate these definitions, we give some examples. Let dot be the only type of size one. 
Let p and p be the two dot-Hags of size two, and let Zi, for 1 < i < 5, be the five admissible dot-Rags 
of size three (recall that we are forbidding K^). These flags are shown in Figured! 



1« 

dot 



lo- 



1® 





1 

Z2 Z^ Z4 Z5 

Figure 1: Some examples of flags of type dot. 



We now compute the flag densities of p and p in the flags Zi. For example, to compute Pdot^'-, ^i), 
note that to induce a copy of p we must choose an unlabeled non- neighbor of 1. As only one of 
the two unlabeled vertices in Zi is a non- neighbor of 1, we conclude that Pdotip', Z\) = i. Similarly, 
Pdotip', Z3) = 1, because to induce p we must select a neighbor of 1, and all the unlabeled vertices 
in Z3 are neighbors of 1. The other flag densities are PdotiP', ^5) = Pdotil^'^ ^2) = l, Pdotip, ^i) = 

Pdotip; Z4) = Pdotip; Zi) = Pdotip; -^4) = ^, and Pdotip; ^2) = Pdotip; Z3) = Pdotip; Z5) = 0. 




Figure 2: Graph W. 



To see how to compute flag densities in an unlabeled graph, consider W, the graph on 5 vertices 
depicted in Figure [2j It is easy to see that ddotip;W) and ddoti7};W) are the edge and non-edge 
densities of W respectively, and so ddotip;W) = jq and ddotip;W) = j^. The computation of 
ddoti^i; W) is a little more involved. As an example, we explain how to compute ddotiZs; W). Note 
that Z3 consists of two nonadjacent neighbors of the labeled vertex 1. Hence for every vertex v € 
y(M^), let Hy denote the number of nonadjacent pairs neighbors of v divided by the total number 
of pairs of vertices in y(l^) \ {v}. ddotiZ3;W) is then the average of «;„ over all vertices in W, 



which comes out to g. Computing the other flag densities gives daot{Zi;W) = ^, daot{Z2;W) = j^, 
ddot{Zi;W) = i, and ddot{Z^,W) = ^. 

We can also compute the joint flag densities of multiple flags. For instance, let us consider 
diiotip, P',W). In this case, we first randomly choose a vertex v to be the labeled vertex. We must 
then make an ordered choice of two vertices in 1/(VF) \ {v}, as we have two flags, each with one 
unlabeled vertex. If both of these vertices are neighbors of v, then we have induced two copies of the 
flag p (note that the adjacency of these two vertices is unimportant). Hence we obtain d^otiPiP] W) 
by averaging over all vertices v the ratio of the number of ordered pairs of neighbors of v to the 
number of ordered pairs of vertices in T^(V7) \ {v}. In this case, we have ddot{p, P', W) = j^. 

Suppose as before we have a type a of size k, a cj-flag F of size / > k, and an unlabeled graph G. 
To compute da{F; G), we averaged over all random partial labelings of G the probability of finding a 
flag isomorphic to F. A simple double-counting argument shows that we can do the averaging before 
the random labeling, which is the idea behind Razborov's averaging operator, as defined in Section 
2.2 of [12j. Let F\q denote the unlabeled underlying graph of F. We can compute da{F; G) by first 
computing d{F\Q; G), the probability that I randomly chosen vertices in G form an induced copy of 
F\q as a subgraph. Given this copy of F\q, we then randomly label k of the / vertices, and compute 
the probability that these k vertices are label-isomorphic to a. This amounts to multiplying d{F\Q; G) 
by a normalizing factor QaiF), that is, d^^F; G) = q(j{F)d{F\Q; G) = q^{F)p{F\Q; G). 

We can interpret the normalizing factor as qa{F) = d„[F]F\Q). From our previous example, we 
have qdot{p) = Qdotip) = QdotiZ^) = 1, QdotiZs) = qdot{Z2) = \ and qdot{ZA) = qdot{Zi) = |. Since 
QdotiZ^) = 1, it follows that ddotiZ^; G) = d{K^; G) is the triangle density of G. 

There are more relations involving d^ and po- than the one mentioned previously. We will now 
state, without proof, a basic fact about flag densities that can be proved easily by double counting. 

Fact 3.1 (Chain rule). If a is a type of size k, m>\ is an integer, and {Fj}^-)^ is a family of a -flags 
of sizes \Fi\ = li, and I >k + 'YllLiih — k) is an integer parameter, then 

1. For any a -flag F of order at least I, we have 

p^{Fi,...,Fm;F)= Y, Pa{Fi,...,Fm;F')p,{F'-F). 

2. For any admissible graph G of order at least I, we have 

d^{Fi,...,F^;G)= Y, d^{F^,...,Fm;H)d{H;G)= ^ P^Fi, ■ ■ ■ ,Fm; F)d^{F;G). 

HeTi F€T^ 

If we apply the chain rule for m = 1, we have the equation po-(-P'; F') = ^pn^jr-j Pa{F; F")pa{F"; F') 
For instance, this gives 

Pdotip; F) = pdot{p; Zi)pdot{Zi;F) + pdot{p] Z2)pdot{Z2] F) + pdotip; Z3)pdotiZ3; F)-\- 
Pdotip; Zi)pdot{ZA; F) + pdotip; Z5)pdot{Z->; F) 
= -pdot{Zi;F) +pdot{Z3;F) + -pdot{Zi;F) + pdotiZ5;F). 



Similarly, we can expand pdotip; F) = \pdot{Zi]F) +pdot{Z2]F) + ^^PdotiZi^F). 

For the ease of notation, we can express these two identities using the syntax of flag algebras: 

p=iZi+Z3 + ^Z4 + Z5, (1) 

T) = —Z\ + Zo -\ Za. 

^ 2 2 

In this syntax, the equation '}2i^i oiiFi = means that for all sufficiently large a-flags F, we have 
^jgj aiP(j(Fj; F) = 0, where Oj € M for all i £ I. We call ^jg/ cti-Fi an eventually zero expression. 
We use A'^ to denote the set of linear combinations of flags of type a. It is convenient to define a 
product of flags in the following way: 

Fi • F2 =^ Y. pAFuF2;F)F, Fi G F%F2 G -F'^,/ > |Fi| + IF2I - \a\. 

(Note that it does not matter what I we choose, as the difference will be an eventually zero expression.) 
For example, instead of writing p^otip, P', F) = pdotiZ^'-, F) + p^otiZ^'jF), we could simply write p^ = 
p-p = Z^ + Z^. For the flags of our running example, involving K^-fiee graphs, the following equations 
are also easily verifiable: p^ = Z3 + Z5, Jp = Z2, and p •'p = \Zi^ + \Z\. Combining these equations, 
we arrive at the following equation, which we shall later require in Section [5) 

4p2 . f = 4Z2 • (Z3 + Z5) = (Z4 + Z^f. (2) 

To further simplify the notation, we can extend the definitions oipa and da to J^ by making them 
linear in each coordinate. For example, P(j(Fi + 2F2,4F3; F4 — F5) = 4po-(-F'i, F3; F4) — 4po.(-Pi,-F3; -^5) + 
%Pa{F2-,F-i\ F4) — %Pf,{F2-,F-i\ F5). The product notation simplifies these extended definitions, because 
Pa{fi-f2;f) =Pa{fiJ2;f) and d^ifi ■ 12] o) = da{fi,f2;g), for any /i,/2,/ G A'' and for any g G A°. 

The last piece of notation we introduce is that of the averaging operator. Recall that for any 
cr-flag F, we had the normalizing factors qa{F) such that da-{F; G) = qa-{F)p{F\Q; G). In the syntax 

clef 

of flag algebra, this averaging operation is denoted by [[-F]]o- = (Zcr-^lo • We can extend this linearly 
to all elements of A'^ . For example 

[[p]]dot = K2, [[Z5]Uot = K3, and [[Z4 + Z2]]dot = \P2 + ^^, 

where P2 is a path of length two on three vertices, and P2 is its complement. This notation is useful, 
because da{f;g) = p{[[f]]a',g) for any / G A'^ and for any g G .4*^, and hence we have a unified 
notation for both types of flag densities. 

3.2 Extremal problems in the flag algebra calculus 

Recall that the typical problem is to minimize the density of some fixed graph J amongst all admissible 
graphs G not containing a forbidden subgraph. We will show how flag algebras can be applied to 
this problem to reduce it to a semi-definite programming (SDP) problem, which can then be solved 
numerically. 



We may use the chain rule to obtam, for any t > \J\, the equation d{J; G) = YlueT ^('^i H)d{H\ G). 
Since Y1,h&t d{H;G) = 1, we have 

diJ;G) > mm d(J;H), 
HeTt 

which is a bound that clearly does not depend on G. 

This inequality is often very weak, since it only uses very local considerations about the subgraphs 
H £ J^t, and does not take into account how the subgraphs fit together in the larger graph G; that is, 
how they intersect. For instance, returning to our example of the (3, 3)-problem, where J = Kj, and 
t = 3, we obtain d{K3;G) > mmn ej^-^^ d^K^; H) = d{K3;P2) = 0, which is the most trivial bound. 
However, by considering how the graphs in J-3 must intersect in G, one might hope to find inequalities 
of the form ^f^^jr and^H; G) > 0, such that when we combine them with the initial identity, we get 

d{J; G) > d{J; G) - V aHd{H; G) = V (d( J; H) - aH)d{H; G) > min {d{J; H) - an}- 

H&Tt HeTt 

Since an can be negative for some graphs H, the hope is that this will improve the low coefficients 
by transferring weight from high coefficients. In order to find such inequalities, we need another 
property of the flag densities. 

Fact 3.2. If a is a type of size k, m>l is an integer, {Fj}^^ is a family of a -flags of sizes \Fi\ = li, 
and I > k + ^^^^^(^i — k) is an integer, then for any flag F of order n > I, we have 



p,{Fi,...,F^-F) 



\{p.{Fi;F) 



.i=l 



+ 0{l/n) 



One can prove Fact 13.21 by noting that, if we drop the requirement that the sets Xi are disjoint 
in the definition of Pa{Fi, . . . , Fm'-,F), the events Ei will become independent, and thus P(n^^-E'j) = 
ni^i ^{^i) = ni^i PfriPi'jP)- The error introduced is the probability that these sets Xi will intersect 
in F, which is 0{l/n). It is tempting to claim a similar product formula for the unlabeled fiag 
densities d^, but we cannot do so. In the above equation, it is essential that all the cr-flags Fi share 
the same labeled type a, and hence we require F to be a u-flag. 

We are now ready to establish some inequalities. Let's first fix a type a of size k. If Q is any 
positive semi-definite {Fj^l x |jy| matrix with rows and columns indexed by the same set Ff , where 
I > k, define 

Q{^n "= E Qfuf.fi ■ F2 G A'. 

Since Q was chosen to be positive semi-definite, we have 

Pa{Q{Fry, F)= Y. QfuF.pAFi; F)p^{F2; F) > 

for any cr-flags F of order at least t = 21 — k. When averaging, we do not necessarily have 
p([[Q{J^'^}]]o-; G) > for an admissible graph G of order n > t, but we do have the following in- 
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equality: 

def 



[[QMG) ='p{[[Q{Tn]UG)= Yl QF,,F,d^{Fi,Fr,G) 
= E QP^'P^ Y. PAFi,F2;F)d^{F;G) 
= E E QF,,F,P.{Fi.F2-,F)\d„{F-G) 



= Y. \ Y. QF,,F,Pa{Fv,F)p,{F2; F) d,{F- G) + 0(l/n) > o„^oo(l). 

Therefore, when n is large, we have that [[Q]]o-(G) is asymptotically non-negative. For each 
admissible graph H of size exactly t, let an = ^Q\\a{H) = ^^^ f-2<^T" QFi,F2daiFi, F2; H). We then 
have 

[[QMG) = Yl O^Hd{H-G) > On^ooil). 
H&Tt 

The expression in the middle of the above equation is called the expansion of [[QJlo-lC) in graphs 
of size i, with an the coefficients of the expansion. For the sake of conciseness, we often omit the 
parameter G and express this asymptotic inequality (combined with the expansion in size t) in the 
syntax of flag algebras 



\m, = i[<3{^)ii.. 



E 



QFi,F2Fi ■ F2 
Fi,F2&Tf 



Y auH > 0. (3) 



HeFt 



(Note that all inequalities between flags stated in the language of flag algebras are asymptotic.) 

For a concrete example, we return to the (3, 3)-problem. If we use the type a = dot, flags of size 
I = 2, expand in graphs of size i = 3, and consider 




where the rows and columns are indexed by p and p (in that order), we obtain (5{-^2°*} — ^{P~ P)^ = 
I (— Zi — Z4 + Z2 + Z3 + Z5) . This expansion is obtained by substituting the expressions for p"^ , p"^ and 
p ■ p that are given above Equation El Averaging gives [[Q]]a = f [[(p - p)'^]]dot = jK^ - \P2 - \P2- 
Recall that K3 + P2 + P2 = 1, since we are only considering i^3-free graphs. Therefore d{K^;G) > 
min/fgj-g {d{K-i;H) — [[QJlaiH)} = j, which is the correct bound for the (3, 3)-problem. 

In general, if we have more than one inequality available, we can combine them together, provided 
they are all expanded in the same size t. Suppose we have r inequalities given by the positive semi- 
definite matrices Qi of the Uj-flags of size k. Adding them together, we obtain 

r 

11 



where 

. / \ 

and we want to maximize min/^gj-^ {d{J; H) — an}- 

Thus we have transformed the original problem of finding a maximum lower bound for d{J] G) 
into a linear system involving the variables {Qi)Fk,Fr As we have the constraint that the matrices Qi 
should be positive semi-definite, this is a semi-definite programming problem. To take the minimum 
coefficient in the expansion, we introduce an artificial variable y, and require it to be bounded above 
by all the coefficients. Hence we have the following SDP problem in the variables y and {Qi)Fi,F2'- 

Maximize y, subject to the constraints: 

. SH = d{J;H) - YJi=i (y:p^^F^^^^.mF„F,d^AFi,F2;H)] - y > for ah F € Tf (The 
variables sh are called surplus variables.) 

• Qi is positive semi-definite for i G [r]. (The matrices Qi are often called the block variables 
of the SDP problem. We can assume without loss of generality that each Qi is symmetric, as 
otherwise we could replace Qi by {Qi + QJ)/2.) 

A computer can solve this SDP problem numerically, allowing for an efficient determination of the 
inequalities required to prove the extremal problem. For some practical remarks on the implemen- 
tation of flag algebras, please see Appendix [Al We note at this point, as shall be seen in Section [H 
that the solution to the SDP problem need not only give the asymptotic bound, but can also provide 
some structural information about the extremal graphs. 

4 The (4, 3)-problem 

In this section we will apply the flag algebra calculus to solve the (4, 3)-problem. Recall in the (4, 3)- 
problem we are interested in finding the minimum number of 4-cliques in a graph with independence 
number less than 3. We prove that any graph on n vertices with independence number at most 2 
must contain at least ^(4) + 0{n^) 4-cliques. This bound is attained by a balanced blow-up of C5, 
which Nikiforov conjectured to be optimal in jlOj . 

The first subsection contains our fiag algebra results, which leads to the asymptotic minimum 
density of 4-cliques. In the second subsection we use the structural information from the fiag algebras 
to derive a stability result. This allows us to determine the value of /(n, 4, 3) exactly for large n, and 
we show that a nearly-balanced blow-up of C5 is the unique extremal graph. 

4.1 The asymptotic result 

We begin by listing the admissible graphs of size 5, the types used in the proof, and the corresponding 
flags. Note that the flags of size 3 and type dot in Figure [6] are those we used as examples in Section 
EH Figured! 
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Figure 3: Graphs of size 5 with independence number at most 2. 
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Figure 4: Type t\ and its flags of size 4. 
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Figure 5: Type T2 and its flags of size 4. 
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Zi Z2 Z^ Z4 Z5 

Figure 6: Type dot and its flags of size 3. 

For each of the types used in the proof, we express the corresponding positive semi-deflnite matrices 
as a sum of squares. In the lemmas that follow, we give these sums of squares, their expansions into 
the admissible graphs of size 5, and provide sketches of combinatorial proofs (note that the lemmas 
were initially obtained by solving the corresponding SDP problem). We begin with the type ti. 
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Lemma 4.1. 



Ai 



(M2 + M4- Ml- Ms 



^•(2G. + 3G3 



G. -G.- 4Go - 2G 



10 



5Gii) > 0. 



Sketch of proof. Let G = {V,E) be a graph on n vertices. Define ti(G) = {{x,y,z) G y(G)^ : 
{x, y}, {x, z} £ E{G) and {y, z} £^(G)}. Every triple (x, t/, ^;) G ti(G) induces a copy of of the type 
Ti in G, where vertex x is labelled "1", vertex y is labelled "2" and vertex z is labelled "3". Fix some 
p = (x, y, z) G Ti(G). Note that M2 and M4 are flags where the unlabeled vertex is adjacent to 2 but 
not 3, while M\ and M3 are flags with the unlabeled vertex adjacent to 3 but not 2. Hence we define 



dp{v) 



def 



1, if {v,y}eE{G) hut {v,z} ^E{G), 
-1, if {v, z} G E{G) but {v, y} ^(G), 
0, otherwise, 



for each v G V{G) \ {x,y,z}. If we denote by F the flag induced by the labelled vertices {x,y,z} 
together with the unlabelled vertex v, we have 



dp{v) 



1, ifF = M2orF 
-1, ifF = MiorF 
0, otherwise. 



M4, 
M3, 



Thus the combinatorial interpretation of the lemma is 



Ai(G) 



3!l 



E 



/n—3\ 



p=(x,y,z)eTi{G) 



X! dp{v)dp\ 



wj 



v,w^{x,y,z} 



v^w 



120© 



E 



y^ dp{v)dp{w) > o„^oo(l)- 



p=(x,y,z)S:Ti{G) v,w^{x,y,z} 



The proof that this summation is asymptotically non-negative is very simple, since 

2 



E dpiv)dpiw) 



E ^p(^) 



E ^pW'' 



v,w^{x,y,z} 



yv^{x,y,z} 



v^{x,y,z} 



and 



120 



E 

p=(a;,j/,z)gTi{G) \vi^{x,y,z} 



E ^p(^) 



0(l/n). 



It remains to expand the products of the flags into admissible graphs of size 5, and thus show that 
Ai = ^ • (2G2 + 3G3 — G5 — Gs — 4G9 — 2Gio — 5Gii). For the sake of conciseness, we omit the full 
details of this calculation. We show how to compute the coefficient of Gio, that is, Ai(Gio); the other 
coefficients follow similarly. 



14 



In this case, the set {x, y, z, v, w} spans a copy of Giq. 

x^ — ^ xcf >py xcf >z y 









zh 6 yh 6 zi» 4 yJs 6 xb ^z xb 6y 

-4-4 

Figure 7: Possible configurations of p inside Gio and corresponding contributions to Ai(Gio). 

We have the following cases: 

1. Vertex x is one of the vertices of degree 3. There are two choices of x satisfying this condition. 
We have the following subcases: 

(a) Vertex y is the vertex of degree 2 of the triangle containing x and z is only neighbor of x 
which is not adjacent to y. This configuration corresponds to the first graph in Figure [71 
As one of the unlabeled vertices is adjacent to y and not z, and the other is adjacent to 
z and not y, both assignments of v and w, we have dp{v)dp{w) = —1. As there are two 
choices for the pair (f , w) and two choices for x, the total contribution for this configuration 
is -4. 

(b) The same configuration as above, but with the roles of y and z swapped. This configuration 
corresponds to the second graph in Figure [7] and its contribution is —4. 

(c) Vertex y is the other vertex of degree 3 and z is the only neighbor of x which is not adjacent 
to y. This configuration corresponds to the third graph in Figure[71 For any possible choice 
of V and w, we have dp{v) ■ dp{w) = 0, hence the total contribution is 0. 

(d) The same configuration as above, but with the roles of y and z swapped. This configuration 
corresponds to the fourth graph in Figure [7] and its contribution is 0. 

2. Vertex x is one of the vertices of degree 2 not in the triangle. Again we have two choices of x 
satisfying this condition. We also have the following subcases: 

(a) Vertex y is the only neighbor of x of degree 3 and z is the other neighbor. This configuration 
corresponds to the fifth graph in Figure [71 For any possible choice of v and w, we have 
dp{v) ■ dp{w) = 0, hence the total contribution for this configuration is 0. 

(b) The same configuration as above, but with the roles of y and z swapped. This configuration 
corresponds to the last graph in Figure [7| and its contribution is 0. 



When we sum the contributions we get —8, and hence the coefficient of Gio is Ai(Gio) = — jfo ~ ~TE 
We now consider the type T2- 



120 15' 

D 
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Lemma 4.2. 



Ao 



1 

10 



(-3A^i - 3N2 - 3A^3 - 3N4 + 2N5 + 2Nq + 2Nj + IN^f 

-24Gi - I2G2 - 24G3 - 8G5 + 28G6 + 9G7 + 9G8+ 
I8G9 + 9Gio - I2G12 + I6G13 + 4OG14) > 0. 



Sketch of proof. Let G = {V,E) be a graph on n vertices. Define r2(G) = {(x,y,z) G ^(G)^ : 
{a;,2/}, {x,2;}, {y,2;} G E{G)}. Every triple {x,y,z) G T2(G) induces a copy of of the type T2 in G, 
where vertex x is labelled "1", vertex y is labelled "2" and vertex z is labelled "3". Fix p = (x, y, z) G 
r2(G). Note that the flags Ni for 1 < i < 4 are those where the unlabeled vertex has at most one 
neighbour in the triangle T2, while in the flags Ni for 5 < i < 8, the unlabeled vertex has at least two 
neighbours in T2- This motivates the definition 



dp{v\ 



def J —3, if V is connected to at most one vertex in {x, y, z}, 
I 2, otherwise, 



for each v G V{G) \ {x, y, z}. The combinatorial interpretation of the lemma is 



A2(G) 



5! (5) 



E 



p={x,y,z)£T2{G) I v,w^{x,y,z} 



E dp{v)dp{w] 



> 0„^oo(l)- 



As in Lemma 14. H this is easily seen to be asymptotically positive. We omit the computation of 
A2(Gj) for i = 1, 2, . . . , 14, which can be performed as in the proof of the previous lemma. □ 

Finally we consider the dot type. Note that in this case the positive semi-definite matrix takes 
the form of a sum of three squares. 



Lemma 4.3. 

A3 = 



{Zi - 2Z2f + 4 • (6^2 - 7Z3 + 8Z4 - QZ^f + ^ • (2Z2 + 3Z3 - 2Z^f 
lb 8(J 



dot 



— (204Gi - II8G2 + 54G3 + 6OG4 - I7G5 + 42G6 - I44G7 
150 



94G8 



2G9 - 64Gio + 160G 



ai 



258G 



12 



281Gi3 + 42OG14) > 



Proof. We omit the proof, noting that the calculations involved are very similar to those in the 
previous lemmas. □ 

We are now in a position to combine the lemmas to obtain a bound on the minimum density of 
4-cliques in admissible graphs. In what follows, K^ represents the clique on four vertices, while G4 
denotes a cycle on four vertices. 
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Theorem 4.4. 

r^oA 2^ 1^ 3 1^ 2^ 24^ 19 ^ 

7^4 - 2Ai A2 A3 = \ G5 -\ Gio H G12 -\ Gi3 

25 5 25 30 75 75 150 

3 1 2 ^ 4 ^ 1 

= 25 + 30^^ +15^^ +15^^^ +10^^^- 
Proof. We first expand the graphs K4 and C4 into admissible graphs of size 5. A straightforward 
calculation gives K4 = ^(Gi + G3 + G4 + 2^6 + 5G14), and C4 = -5(^10 + 2Gi2 + G13). Note that the 
density of graphs on k vertices is measured with respect to (Tj , and so the normalization factor of 4 
appears when expanding graphs on four vertices to graphs on five vertices. Now we use Lemmas 14. 1^ 
14.21 and 14.31 to expand Ai, A2 and A3 into the graphs Gj. Noting that X^j C; = 1, we can replace 
X ^j Gi with ^, which results in the above theorem. D 

We conclude this section by using the above theorem to deduce some structural information about 
extremal graphs. Recall that t4{G) denotes the number of 4-cliques in G, while for any graph H, 
tniG) counts the number of induced copies of H in G. 

Corollary 4.5. Suppose G is a graph on n vertices with ti{G) = (^ + o(l)) (2). Then 

(i)tG,{G) = o{n^), 

(a) tciiG) = oin'^), and 

(Hi) all but o{n) vertices of G have degree (| + o(l))n. 

Proof. Applying Theorem 14.41 to G, we have 

d{K,;G)-2Ai{G)-^A2{G)-^As{G) = ^ + ^d{G,;G) + ^diC,;G) + ^d{Gu;G) + ^d{Gry,G). 
In particular, using the asymptotic non-negativity of Aj(G), we have 

d{K,; G)>^ + ^d{G,; G) + ^diC^; G) + ^A3(G) + o(l). 
25 30 15 5 

Thus if d{K^; G) = ^ + o(l), we must have ^(Gs; G) = ^(Q; G) = AsiG) = o(l). This immediately 
gives tG^{G) = o{n^) and tc^{G) = o{n^), and so it remains to justify (iii). We have 



A3(G) 



(Zi - 2Z2f + -^(6^2 - 7^3 + 8Z4 - 6^5)2 + ^(2Z2 + 3Z3 - 2Z5)2 
lb oU 



= o(l). 

dot 



For every vertex v, let F^ be the dot-Hag obtained from G by labeling the vertex v with 1. By 
definition of the averaging operator, A3(G) is the average over vertices v of the corresponding fiag 
densities in Fy. The expression is a sum of squares, and thus will be asymptotically non-negative. 
Since the average is o(l), the expression must be o(l) for all but o{n) vertices. In particular, for these 
vertices we have 

Pdot{Zi; Fy) - 2pdot{Z2; Fa) = 0(1), 
Qpdot{Z2; Fy) - YpdotiZs; Fy) + 8pdot{Zi; Fy) - GpdotiZ^; Fy) = o(l), and 
2pdot{Z2; Fy) + 3pdot{Z3; Fy) - 2pdot{Z^; Fy) = o(l). 
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Since the sum of the flag densities must be 1, we also have 

Pdot{Zi;F^) + Pdot{Z2] F.a) + PdotiZs; Fy) + PdotiZr, F^) +Pdot{Z5;F^) = 1. 

Finally, recall from Equation ^ in Section [3] that 4^2 • {Z^ + Z^) — (Z4 + Zi)^ = 0. Applying this 
to Fy, we have 

^Pdot{Z2]Fy) {pdot{Z3; Fy) +Pdot{z->;Fy)) - {pdot{ZA;Fy) + pdotiZr, Fy)f = 0(1). 

This gives us a system of five equations in the five variables pdot{Zi', Fy). The first four equations 
form a linear system of full rank, which we can use to express all the variables in terms oipdot{Z^]Fy). 
Substituting these terms into the fifth equation gives a quadratic equation mpdot{Z^; Fy), which results 
in two solutions, namely {pdotiZf, Fy))^^-^ = (^, ^, ^, ^, ^) + o(l) or (i, i, 0, 0, i) + o(l). 

We now show that the second solution implies a large number of 4-cliques. Indeed, suppose 
V gV was a vertex with {pdot{Zi; Fy))^^^-^ = (i, i, 0, 0, |) + o(l). Recall from Equation ([T|) in Section 
3 we have p = ^Zi + Z3 + ^^4 + Z5, where p is the dot-fiag of size 2 corresponding to an edge. 
Applying this to the fiag Fy, we deduce that the degree of u is (^ • ^ + 3 + o(l)) n = ^n + o{n). 
Thus there are 2^ + o{n) vertices v is not adjacent to, and since G is K^-fiee, these vertices must 
form a clique. This clique contains ^2" "'•"-'^ r^ 16(4) 4-cliques. Consider now the neighborhood 
of V. Since pdotiZ-y, Fy) = o(l), it follows that the neighborhood is missing at most o{n?) edges. 
Hence the number of 4-cliques in the neighborhood of v is (2" !' ) — o{n'^) ~ ^(4)- Thus we have 
t4{G) > (I + 0(1)) (2), which contradicts our assumption that t4{G) = (^ -|- o(l)) (2). 

Hence for almost all vertices v, we have {pdot{Zi; Fy))^^^ = (^, ^, ^, ^, ^) + o(l). Applying 
Equation ([T|), we deduce that the degree ofuis (^•^ + ^ + ^'^ + ^+ o(l)) n = (|n -|- o(l)) n, 
as claimed. 

D 

4.2 The stability analysis 

We will now use the results of the preceding section to show that, for sufficiently large n, a blow-up 
of C5 is the unique extremal graph for the (4, 3)-problem. Recall that in a blow-up, we replace every 
vertex with a clique, and every edge with a complete bipartite graph. Hence a blow-up of C5 consists 
of five disjoint sets of vertices Vi, with Vi U V^+i a clique for all 1 < i < 5, and no edges between Vi 
and Vi+2 for all 1 < i < 5 (throughout this section, indices will be taken modulo 5). 

Suppose G is a i^3-free graph on n vertices with the minimal number of 4-cliques. Our proof 
consists of three steps. We first use the results of Corollary 14.51 to deduce that G is close to being 
a blow-up of C5 (note that this holds not only for an extremal graph, but for any family of graphs 
that is asymptotically optimal). In the second step we use the minimality of G to show that G must 
in fact be a blow-up of C5 with asymptotically equal parts. Finally, we solve an integer optimization 
problem to determine the size of the parts of G exactly. 



Recall that from Corollary 14.51 we have that if n is sufficiently large, and G is an extremal graph 
on n vertices, then t4{G) = ^(4) + o(n^), tc^iG) = o{n^), tc^iG) = o{n^), and all but o{n) vertices 
of G have degree |n -|- o{n). From this we shall deduce that G is almost a blow-up of C5. To this end, 
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we introduce some definitions. Given subsets A,B C V{G), we say A is an almost clique if all but 
0(71"^) pairs in A are adjacent, and we say {A, B) is almost com,plete {alm,ost em,pty) if all but o{v?) 
pairs in Ax B are adjacent (nonadjacent). Finally, we define a triple {a, b, c} G V{G) to be typical if: 

(i) {a, b} ^ E{G), c G N{a) n N{b), d{a) , d{b) , d{c) = f n + o(n), 

(ii) {a, b} is contained in o(n^) copies of C4, 

(iii) {a,b,c} is contained in o(n) copies of C4, and 

(iv) {a,b,c} is contained in o(n^) copies of G5. 

Note that G[{a, b, c}] is an induced path of length 2. As all but o{n) vertices are of degree |n+o(n), 
it is easy to see that there are O(n^) induced paths of length 2 in G. As Corollary 14.51 asserts that 
tCiiG) = o{n'^) and tc^iG) = o{n^), it follows that almost all induced paths of length 2 are typical. 
We will now use the neighborhoods of {a, b, c} to define the parts corresponding to the blow-up of 
C5. In particular, we define 

Vi =N{a)n N{b) , V2 = {a} U (N{a) n mb) n N{c)) ,¥3 = N{a) n iV(6) n iVXcJ, 
V4 = iV(a) n N{b) n iVXcJ, and V5 = {b} U ('N{a) n N{b) n iV(c 



We now make some preliminary observations about the sets Vi. Clearly, by definition, the sets are 
disjoint. Moreover, since a{G) < 2, and {a,b} ^ E{G), we must have N{a) U N{b) = V{G) \ {a,b}, 
and so UjT^ = V{G). Similarly, for any vertex v G V{G), N{v) must induce a clique, as any non-edge 
in N{v) forms an independent set of size three with v. Thus V2 U V3, V3 U V4, and V4 U V5 are (actual) 
cliques. Finally, note that if u, f G Vi are such that {u,v} ^ E{G), then the set {a,b,u,v} induces a 
copy of C4. Since {a,b,c} was chosen to be a typical triple, properties (ii) and (iii) imply that Vi is 
an almost clique, and c is adjacent to all but o{n) vertices in Vi. 

We can also obtain some relations regarding the sizes of these parts. By property (i) of typical 
triples, we have d{a) , d{b) , d{c) = |n + o{n). Since N{a) U N{b) = V{G) \ {a,b}, we have \Vi\ = 
\N{a)nN{b)\ = \N{a)\ + \N{b)\-\N{a)UN{b)\ = ln + o{n). Moreover, as A^(a) U {a} = liuy2Uy3, 
N{b) U {b} = Vi U V4 U V5, N{c) \ Vi = V3 U V4, and c has o(n) non-neighbors in Vi, we deduce 

2 2 2 

\V2\ + IV3I = 7n + o(n), 11/3! + IV4! = -n + o{n), and IV4I + IV5I = -n + o{n), 
5 5 5 

which also imply IV2! + IV5I = |n + o{n). 

We are beginning to uncover the approximate Cs-blow-up structure of G. Recall that we have 
shown that V2 U V3, V3 U V4 and V4 U V5 are cliques, while Vi is an almost clique. We will establish 
the relations between the remaining parts by showing: 

• {Vi, Vi^2) is almost empty for any 1 < i < 5, and 

• (Fi,V2) and (VijV^) are almost complete. 
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We start by showing that (Vi, V3) is almost empty. For any u G VinA^(c) and v G V3, if {u, v} E E{G), 
then the set {a,b,c,u,v} induces a copy of G5. As {a,b,c} is a typical triple, property (iv) implies 
that there are at most o{n'^) copies of G5 containing {a,b,c}, and so there are at most o{n?) edges 
between Vi n N(c) and V3. Since c is adjacent to all but o{n) vertices in Vi, this shows that (Vi, V3) 
is almost empty. By the symmetry between a and b (and hence V3 and V4), it follows that (Vi, V4) is 
also almost empty. 

Now consider the vertices in Vi. By Corollary 14.51 all but o{n) of these vertices have degree 
|n + o{n). Since (Vi, V3 U V4) is almost empty, it follows that all but o{n) vertices in Vi have o{n) 
edges to V3 U V4. Hence, since \Vi\ + IV2I + IV5I = |n + o(re), it follows that Vi is almost complete to 
Vi U V2 U V5. In particular, (Vi, V2) and (Vi, V5) are almost complete. 

Next consider the vertices in V2. We have established that {V2, Vi U V2 U V3) is almost complete. 
Once again, using the restriction on the degrees, and the fact that |Vi| + IV2I + IV3I = |n + o{n), 
we deduce that (V2,V4) and {V2,V^) are almost empty. Symmetry implies {V^,V2) and {V^,V^) are 
almost empty as well, as claimed. 

At this point we have determined the global structure of G, in which each part Vi corresponds 
approximately to the blow-up of a vertex in C5 . We now wish to show that G is an exact blow-up of 
C5, with parts of size ^n -|- o{n). 

In order to do so, we shall require greater control over the adjacency of individual vertices, and 
not just the parts Vi. With this in mind, for each 1 < i < 5, we define a vertex v ^ Vi to he bad if 
V has J7(n) non-neighbors in Vi-i U Vi U T^+i or Q(n) neighbors in Vi+2 U 1^+3. Since for each i we 
have that Vi U V^+i is an almost clique and (V^, Vi+2) is almost empty, it follows that there are o{n) 
bad vertices. We clean up the partition of V{G) by removing bad vertices from each Vi and placing 
them in a set U. This results in a partition V{G) = Vi U . . . U V5 U C/ satisfying: 

(1) for any 1 < i < 5 and vertex v G Vi, v is adjacent to all but o(n) vertices in Vi-i UViU V^+i, 
and V is not adjacent to all but o{n) vertices in Vi+2 U Vi+3, and 

(2) ^2 U V3, V3 U V4, V4 U V5 are cliques, and 

(3) 1^1 1 = ln + o{n), 11/2 uy3|,|F3UT4|, 1^4 UFsl = |n + o(n), and \U\ = o{n). 

The following proposition asserts that in an asymptotically optimal graph, the above conditions 
imply that the almost cliques are, in fact, true cliques, and that the parts are asymptotically equal. 
This will in turn allow us to completely determine the structure of extremal graphs. 

Proposition 4.6. // Vi, 1^, . . . , V5 satisfy (1),(2) and (3), then for any 1 < i < 5, Vi U Vi+i is a 
clique, and \Vi\ = ^n + o{n). 

Proof. We already know from (2) that many of the pairs of neighboring parts are cliques. It remains 
to show that Vi U V2 and V5 U Vi are both cliques. We first show that Vi is a clique. Suppose 
for contradiction that there are nonadjacent vertices u,v G Vi. Since a{G) < 2, we must have 
V3 U V4 C N{u) U N{v). By (3) we have IV3 U V4I = ^n + o{n), and so either u or u must have at least 
-n + o{n) neighbors in V3 U V4. However, this contradicts (1). Thus Vi is a clique. 
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We now claim that if {Vi, V2) is not complete, we must have IV4I = o{n). Indeed, suppose u & Vi 
and V (z V2 are not adjacent. Since a{G) < 2, we must have V4 C N{u) U N{v). By (1), both u and 
V have o{n) neighbors in V4, which implies IV4I = o{n). By symmetry, if (Vi, V5) is not complete, we 
must have IV3I = o{n). 

Suppose now that one of these sets, say V4, is of size o{n). Using (3), we must have IV3I = IV5I = 
|n + o(n), and IV2J = o{n). Since IV3I 7^ o{n), it follows that (Vi,V5) is complete. Thus G has two 
large disjoint cliques: V3 of size |n + o{n), and Vi U V5 of size |n + o{n). This gives 

''<°)K'"r'">(«"r'"Vii(:)-<"')>i(: 

contradicting the asymptotic optimality of G. Hence (^1,^2) and (VijV^) must be complete, which 
implies that Vi U V2 and Vi U V5 are cliques. 

Finally, we show that all parts have size ^n + o{n). Recall we already have \Vi\ = ^n + o{n). Since 
IV3I + IV4I = ^n + o{n), we may by symmetry assume IV3I > ^n + o{n). Corollary 14.51 implies there is 
some vertex of V3 whose degree is |n + o{n). By (1), this implies IV2I + IV3I + IV4I = |n + o{n). As 
l^sl + 1^1 = l'^ + o{n), this implies IV2I = |n + o(n). Combined with the equations in (3), this gives 
\Vi\ = |n + o{n) for all 2 < i < 5. □ 

We now turn our attention to the set U of bad vertices. In particular, we will show that in an 
extremal graph, each u ^U can be reintroduced into some part V^ in a way that is consistent with (1) 
and Proposition 14.61 Since \U\ = o{n), we can repeat this process without affecting (1) or Proposition 
and thus we can eliminate the set U . 



Proposition 4.7. For every u €z U , there is some i = i{u) such that Vi^i U T^ U Vi+i C N{u), and u 
has o{n) neighbors in Vi^2 U Fj+s. 

Proof. Fix u (z U. We begin with a simple claim. For any 1 < j < 5, if there is some v £ Vj such that 
u is not adjacent to v, then u is adjacent to all but o{n) vertices in Vj+2 U V^+s- Indeed, as a{G) < 2, 
we must have Vj+2 U V^+s C N{u) U N{v). However, v is adjacent to o{n) vertices in Vj+2 U V^+3, and 
so the claim follows. 

Now suppose there is no i such that Vi_i U Vi U Vi+i C N{u). This implies there is an i such that 
u is not adjacent to some vertices in both Vi-3 and T^_i. Applying the previous claim, it follows that 
u is adjacent to all but o{n) vertices in Vi-i UViU V^+i U Vi+2- 

In this case, remove all edges between u and K+2) and add any missing edges between u and 
Vi-i UViU Vi+i U U. It is easy to see that we still have a{G) < 2. As u had ^n + o{n) neighbors in 
Vi+2, which is a clique, we have removed at least (s" °^"'^) = U{n^) 4-cliques. On the other hand, we 
have only added o{n) edges, and so created o{n'^) new 4-cliques. Thus we have reduced the number 
of 4-cliques, which contradicts the extremality of G. 

Thus there must be some i = i{u) such that Vi-i U Vj U V^+i C N{u). It remains to show that u 
has o{n) neighbors in Vi+2 U l^i+s- Suppose for contradiction that u has Q{n) neighbors in Vi+2 U ^i+3. 
As Vi+2 U Vi+3 is a clique, these neighbors form Q{n^) 4-cliques with u. Instead, we could remove 
all edges between u and Vi+2 U Vi+3. To prevent the formation of an independent set of size 3, we 
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add all edges between u and U. This introduces o{n) new edges, and thus o{n^) new 4-cliques, while 
maintaining a{G) < 2. Thus the number of 4-cliques is reduced, again contradicting the minimality 
of G. This completes the proof. 

D 

Given any u G [/, we can apply Proposition 14.71 to add u to Vn^y^-j . Repeat this process until U 
is empty. In this case we have a partition V{G) = Vi U . . . U V5 such that for every 1 < i < 5, 
\Vi\ = ^n + o{n) and Vi U V^+i is a clique. 

In order to conclude that G is a blow-up of C5, it remains to show that there are no edges 
between Vi-i and l^+i for any i. Suppose to the contrary there is an edge between some v € Vi-i 
and w S Vi+i- Note that when n is large, we must have \Vi\ = -^n + o{n) > 2. For any x,y ^ Vi, 
{v,w,x,y} is a 4-clique. Thus removing the edge {v,w} reduces the number of 4-cliques without 
increasing the independence number. Hence in an extremal graph, there are no edges between Vi-i 
and Fj+i for any i, and thus G is indeed a blow-up of C5 with parts of size |n -|- o{n). 

We now seek to determine the sizes of the sets Vi exactly. Noting that Vi U V^+i is a clique for 
each i, it is easily verified that 

MC)^Er'T^'')-E(":- 

Define yi = \V2i-1 U V2i\ for all 1 < i < 5. In ^yj, each vertex is counted twice, so we have 
YlVi = 2n. Moreover, as \Vi\ = ^n + o(n), we have yi = ^n + o{n). Finally, as n — yi — yj+i = 
n — |V2i-i| — |V2j| — IV2J+1I — |V2i+2| = !V2i-2|, we can rewrite the above expression as 



wo = E ft) -E (""%"'•")• 



Thus to find the extremal graph, we must minimize the above expression over integer values of yi 
subject to the conditions given earlier. The solution is given by Lemma 14.81 which we prove in 
Appendix [Bj 

Lemma 4.8. Let e > 6e sufficiently small, and n sufficiently large. Consider the function 

I \ S^ fy^\ sr^ [n-yi- yi+i\ 

9{yi,y2,y3,y4,y5) = 2^{^] - 2^{ . I- 

Subject to the constraints that the yi be integers satisfying J2i-iyi = 2n and |yi — |n| < en, g is 
uniquely (up to cyclic permutation of the variables) minimized when the yi take values \^\ and |"^] 
in ascending order. 

From Lemma 14.81 we see the minimum occurs when yi = [" "'^^~ 1 for 1 < i < 5. Solving for jV^j, 
we have that the unique extremal graph on n vertices is the blow-up of C5 to n vertices such that: 

• when n = 5k, \Vi\ = k for all i. 



22 



• when n = 5/c + 1, |Vi| = IV2I 

• when n = 5/c + 2, |Vi| = IV2I 

• when n = 5k + 3, \Vi\ = IV2I 

• when n = 5/c + 4, |Vi| = IV2I 



/c, iFsl = IV5I = k + l, and |T4l = k-1, 
\V4\ = k, and [Vsl = [V^l = k + 1, 
IV4I = k + 1, and IV3I = IV5I = k, and 
k + l, \Vs\ = \V5\ = k, and IV4I = k + 2. 



5 The (3, 4)-probleni 

In this section we solve the (3, 4)-probleni, and prove that Erdos' conjecture holds for this case. 
Recall that this entails showing that amongst all graphs of independence number less than four, T„^3, 
a disjoint union of three nearly-equal cliques, minimizes the number of triangles. 

In the first subsection we list our flag algebra results, which give the asymptotic minimum number 
of triangles to be 5(3). In the second subsection we use the structural information obtained to 
determine the value of /(n, 3, 4) exactly. We also analyze the structure of extremal graphs, and show 
they must contain r„^3. 

5.1 Getting the asymptotic result and densities 

We begin by presenting the 29 admissible - that is, Kj^-iiee - graphs of size 5, followed by the three 
types and associated flags used in the proof. 
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Figure 8: Graphs of size 5 with independence number at most 3. 
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03 




T2 



2« e 
lo 03 

Ni 



2« 
1 



2« 
-03 1 




iV2 

2(;> 



1 



29 



2 

03 1 



2® ® 
-03 lo 03 



Na 



N7 



63 16 A3 



Figure 10: Type T2 and its flags of size 4. 
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Figure 11: Type dot and its flags. 
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In the subsequent lemmas, for each type used in the proof, we express the corresponding positive 
semi-definite matrices as squares of flags, and give their expansions into graphs of size 5. The coeffi- 
cients were obtained through the use of a computer program, but can easily be verified by hand, just 
as in the previous section. 

Lemma 5.1. For the type ti, we have 

Ai = [[(Ml - M2)%, 

' (Go-G.-AGa), 



30 

A2 = [[(3Mi - 3M2 - IOM3 + 10M4)% 



30 



(9G2 - 9G3 - 36G6 - 6OG9 + I6OG11 + IOOG13 + 6OG15 - 6OG16 - IOOG25 - 500G 



26) 



Lemma 5.2. For the type T2, we have 

A3 =[[(-3iVi -N2 + 3Ns + SN^fjU 

=^( - I8G2 + 9G8 + 3G9 - llGio + 3Gi2 + 27Gi4 - I8G16 + 9G20 + 36G24) 
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A4 =[[{-20Ni - 2ON2 + llA^s + 11A^4 + 9N5 + me)%, 

=i-( _ 440G2 - 36OG3 + 400G8 + I2IG9 - 480Gio - 360Gii - 319Gi2 + 198Gi3 + 363Gi4 

OU 

- 279Gi5 - 242Gi6 + I2IG20 + 279G23 + 484(^24 + I98G25 + 405^26), 



A5 =[[(-19iVi - 15iV2 + 15iV3 + 15iV4 + 4iV5 + ANq + 15iV- 



7)%, 



--—{ - 57OG2 - I52G3 - 57OG4 + 361G8 + I8IG9 - 675Gio - 120Gii - 735Gi2 + 240Gi3 

OU 

+ 675Gi4 - 136Gi5 - 450Gi6 - 1350Gi8 + 900Gi9 + 795G2o + 675G2i + I36G23 + 9OOG24 
+ I2OG25 + 8OG26 + 45OG28), 



As =[[(-6iVi - 14iV2 - 2N3 - 2N4 + 8N5 + SNe - 5N7 + 10iV8)^]] 



Tl 



=— ( + 24G2 - 96G3 + 6OG4 - 240G6 + 36G8 - 76G9 + 3O8G10 - 264Gii + I6OG12 - II2G13 

OU 

+ 12Gi4 - 32Gi5 - 8G16 - 54OG17 + 42OG18 + 40Gi9 - 56G20 + 75G2i + 32G23 + I6G24 
+ 88G25 + 32OG26 - 8OG27 - I5OG28). 

Lemma 5.3. For the type dot, we have 

A7 =[[{-2Zi + Z2f]]dot 

=4 (eCi + 2G2 + 2G3 - 8G5 + 4G6 - 10G8 - 4G9 - 2Gii - Gi5 + Gie + 6G22 + 2G23 
15 

+ G25 + 5G26), 

As =[[{-2Zi - Z2 + 4Zsf]]dot 

=— ( - 42Gi - 2G2 + IOG3 + 24G4 + 24G5 + 36G6 + 48G7 - 2G8 - 4G9 + 2Gii - 4Gi3 
15 

— Gi5 — 7Gi6 — I8G22 — 6G23 + G25 + 5G26) , 

A9 =[[(7Zi - 4^2 + Z3 + 3Zif]Uot 

=— (l38Gi + 6IG2 + 43G3 - 39G4 - I4IG5 - 45G6 + 3G7 - 146G8 - I9G9 + 42Gio 
15 

- 52Gii + 2IG12 - 22Gi3 + 9Gi4 - 25Gi5 + 65Gi6 + 54Gi7 + 54Gi8 + I8G19 - 6G20 
+ 72G22 - 2IG23 - 96G24 + I9G25 + 125G26 + I8G27) , 

Aio =[[(8Zi - 2Z2 - 9^3 + lOZ5f]]dot 

=— ( - I68G1 + IO3G2 + 85G3 + 17OG4 + I53G5 + 226G6 + 3G7 - 16G8 + 4G9 + I6OG10 
15 

- I6G11 + I2OG12 - 132Gi3 - I2OG14 + I6G15 + 8OG16 + 24OG18 + 7OG19 - I2OG20 
+ 6OOG21 - I38G22 - 136G23 - 26OG24 - 86G25 + 2OG26 + 2OOG28 + I5OOG29), 
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All 



dot 



=— (Gi + G2- 2G3 + 4G4 - 2G5 - 2Ge + IOG7 - 5G8 - 2G9 + 4Gio - 2Gii + 7Gi2 

yu 

+ 4Gi3 + 13Gi4 - 5Gi5 + G16 + Gi7 + ISGig + 19Gi9 + IOG20 + 28G21 - 5G22 - 2G23 
+ 4G24 + G25 - 5G26 + 7G27 + I6G28 + 4OG29) ■ 

We can now combine these lemmas to obtain an asymptotic lower bound on the density of triangles, 
K^, in any K4-free graph. 



Theorem 5.4. We have 



where 



11 ^29 , 



«=1 



9 1 



c = (c,)Jii = —J (263984,4720,4432, H^, I?Z^, l^^^l^^, 1185, 8437, 3440,856, 1128 ) . 

V *7»_i 25 . 3 . 1009 \^ ' ' ' 371 ' 112 ' 3392 ' ' ' ' ' y 

Proof. We begin by expanding K^ into graphs of size 5. A straightforward calculation gives 
1 



i^3 = Y^ ( Gi + G2 + 2G4 + 4G7 + Gio + 2Gi2 + 2Gi3 + 4Gi4 + Gie + 3Gi8 + 5Gi9 
+ 3G20 + 7G21 + G22 + G23 + 2G24 + G25 + 2G27 + 4G28 + lOG 
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We now use the lemmas to expand the squares Aj into the graphs Gj . After summing the coefficients 
in the linear combination, it can easily be verified that they are all at least g. Since the densities 
must sum to 1, we have Yli=i ^j ~ 1' which gives the final equality. □ 



Corollary 5.5. Any n-vertex graph G with a{G) < 3 satisfies 



tsiG) 



47 



I) 4036n ^ 

3/ V 



E 



djv) 
n-\ 3 



-I > - -O„^oo(l) 



Proof. Since the Aj are squares of fiags, they are asymptotically non-negative. E 
terms for A,, 1 < i < 10, maintains the inequality. This gives K^ — -^^ (^p — ^y 
Interpreting these terms combinatorially gives the corollary. 



ence discarding the 

_ 9~On— >•oo(-Lj• 



dot 



5.2 The stability analysis 

In order to derive a stability result for the (3, 4)-problem, we use the following well-known result of 
Andrasfai, Erdos and Sos [2]. 

Theorem 5.6. (Andrasfai, Erdos, Sos) A K^-free graph on n vertices that has minimum degree larger 



than 



3r~7 
3r-4 



n must he (r — l)-partite. 



26 



Applying this to the complement of a graph with r = 4, we find that a graph G on n vertices 
with a{G) < 3 and maximum degree less than |n must be spanned by three cliques. The following 
stability result follows. 

Proposition 5.7. Suppose < e < ^q. There exists no = no(e) such that any graph G on n > 
no vertices with a{G) < 3 and ts{G) < (^ +e^) (3) contains an induced subgraph G' C G on at 
least (1 — lOOe^) n vertices that is spanned by three cliques of size between (^ — 3e) n and (| + e) n. 
Moreover, every vertex in G' sends at most 4en edges outside its clique. 

Proof. We have from Corollary 15.51 that for any graph G on n vertices with a{G) < 3, 

t^iG) 47 ^fd{v) 1\' 1 



In particular, if t^{G) < (^ + e^) (3), and n is large enough, then 



2 



Let B = {v: d{v) >{^ + e)n}. Then jSje^ < ^^ f g ~ s)' < 100^^"' and so \B\ < WOe^n. 
Let G' be the induced subgraph on V{G) \ B. As claimed, G' has n' > (1 — 100e^)n vertices. 
Moreover, since e < -^ the maximum degree A(G') is bounded by 



A(G') <(- + s)n< ^^\ n' < -n' . 
^ ^ \3 J - 1-I00e3 8 



Hence we can apply Theorem 15.61 in its complementary form to deduce that G' is spanned by three 
cliques. 

Since A(G') < (| + e) n, we deduce that the largest clique in G' has size at most (| + e) n. This 
implies that the smallest clique has size at least (l — lOOe^) n — 2 (^ + e) n > (3 — 3e) n (using the 
bound e < 3q). This implies that every vertex in G' can send at most (g+e^n— (3— 3e") n = 4en 
edges outside its own clique. 

Finally, consider the vertices in i?. If any vertex v €z B is adjacent to all vertices in one of the 
cliques Gj, and does not have more than Aen edges outside Gj, then we can add v to Gj without 
affecting any of the previous bounds. Thus the only vertices left in B are either those adjacent to 
one clique, but with too many neighbors outside the clique, or those with a non-neighbor in each of 
the three cliques. □ 

This stability result allows us to, for large values of n, deduce the exact value of the (3, 4)-problem, 
and also to characterise all extremal graphs. Recall that we define f{n,k,l) to be the minimum of 
tk{G) over all graphs G on n vertices with a{G) < I — 1. 

Theorem 5.8. There exists uq such that for every n > uq, /(n,3,4) = ('-"'^^-') + (l-("+gi)/3J) + 
( 3 )• Moreover, if G is a graph on n > no vertices with t3(G) = /(n,3,4), then G contains 
Tn^3, a disjoint union of three nearly-equal cliques. 
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Proof. First note that G = Tn^s has a(G) < 3 and so we have the upper bound /(n, 3, 4) < t^iTn^s) = 
(K3J) + (L{n+^i)/3j) + (L(r^+2)/3j^ ~ ^ (3) " note that this upper bound holds for ah n. 

To obtain a matching lower bound, we apply the stability result from Proposition 15.71 Take 
e = Y^, and let n > no(e) be sufficiently large. Suppose G is an extremal graph on n > no vertices. 
In particular, we have ^3(6*) < (g + e^) (3) for n large enough. From the proof of the proposition, 
we know that there is a set B of at most lOOe'^n 'bad' vertices, and the remainder of the vertices 
are in three cliques, with at most 4en edges to the other cliques. Label the cliques in order of size, 
say \Ci\ > IC2I > IC3I. We will show that an extremal graph cannot have any bad vertices, so G is 
spanned by the three cliques. We begin with a simple observation. 

Claim: Every vertex v G V{G) is in at most (' "* 2 ) triangles. 

Proof: Suppose some vertex v were in more triangles. Delete v, and add a new vertex v' with 
N{v') = Gs U B. This does not increase the independence number, and v' is in at most (' ^2 ) 
triangles. Hence we have decreased the number of triangles in G, which contradicts the minimality 
of G. Note that (l^^'+l^l) < (1^^') + \B\n < (l^^l) + lOOe^n^ = (l^^l) + e'^n^. 

Now consider a potential bad vertex v a B. There are two reasons v could be bad: 

Case 1: v is adjacent to all vertices of one of the cliques Cj, but has more than 4en neighbors in 
the other cliques. 

If V has more than 4ere neighbors in the other cliques, it must have at least 2en neighbors in one 
of them. Note that every pair of these neighbors creates a triangle with v. Thus v is in at least 
( 2 ) ~'~ ( 2"^) ^ ( 2^ ) "^ s'^n'^ triangles, which contradicts our earlier claim. Hence this case cannot 
occur. 

Case 2: v has a non-neighbor in each of the three cliques. 

Let di = \Gi\ N(v)\ be the number of non-neighbors of v in the ith clique. Consider the cliques 
in increasing order of these values, that is, suppose di^ < di^ < di^ . Let x be a non-neighbor of v in 
Gil ■ 

Case 2a: Every vertex y £ Gi^ is adjacent to one of {v,x}. 

Since x has at most 4en neighbors in Gi^, it follows that di^ < di^ < 4en. Counting only the 
neighbors of v in the cliques Cji and C^^ , we see that t; is in at least (|ChI-4'^") + (|C'2I-4£") > 2(|C3|-4£n^ 
triangles. We have 2{^^^^-^'''') ^ {G^l'^ - SejCsIn + lee^n^. Since IC3I > (5 - 3e) n and e = j^, this 
is greater than (' 2^ ) + e^n^ < ^|C3p -|- e^n^, which contradicts the earlier claim. 

Case 2b: v and x have a common non-neighbor in Cjj, say y. 

In this case, as a{G) < 3, every vertex in Cjg must be adjacent to one of {v,x,y}. Since x and 
y have at most 4en neighbors in Cjg, it follows that di^ < di^ < di^ < Sen. Thus v is in at least 
^\C,,\-8en^ ^ ^\c.,\-8en^ ^ ^\c.,\-8en^ > 3(1^^1^^^") « ^{G^l^ - 24e|C3 |n + QGe^n^ triangles. Again, 

given our bounds on IC3I and e, this is greater than (' 2^* ) + e^n^, which gives a contradiction. 

Thus we have shown that in an extremal graph, there are no bad vertices, and so the three cliques 
span all n vertices and \B\ = 0. Now note that any vertex in Ci is in (' 2 ) triangles from within 
Ci alone. By the earlier claim, we must have (' 2 ) — ( ^2 ) ~ ( 2^)' ^om which it follows that 
|Ci| — 1 < IC3I. Thus IC3I < IC2I < \Gi\ < IC3I -|- 1, which shows that the cliques must be nearly 
equal in size. 
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This implies that T„^3 C G, and so it follows that for any graph G on n vertices with a{G) < 3, 
we must have ^3(6') > t3(r„^3). Thus /(n,3,4) = t3(r„^3). Moreover, if G is an extremal graph, then 
since we have equality, there can be no triangles with vertices from different cliques. This means 
that each vertex can have at most one neighbor in each of the two other cliques; in other words, the 
bipartite graphs between cliques are (partial) matchings. These matchings must be such that there 
is no triangle with one vertex from each clique. However, the extremal graph is not unique, as there 
are many possibilities for the matchings. 

D 

6 Concluding Remarks 

In this paper, we apply the techniques of flag algebras, combined with stability arguments, to solve 
the Erdos problem for the cases {k,l) = (4,3) and (3,4). In particular, we show that Nikiforov's 
construction of a blow-up of C5 is optimal for the (4, 3)-problem, while Erdos' conjecture still holds 
for the (3, 4)-problem. 

We have also run the SDP problem for larger cases, and our calculations suggests that Erdos' 
conjecture remains valid for the (3, 5)- and (3, 6)-problems. Moreover, it would appear that a blow- 
up of C5 is also optimal for the (5, 3)-problem. Since this paper is already quite long, we decided not 
to process the SDP results to find rational solutions. However, after doing so it should be possible 
to develop stability results similar to those above, and thus to determine the exact solution to these 
problems. 

Note that the extremal graphs we have found are all blow-ups of small graphs. In particular, the 
graphs are Ramsey graphs. The construction of / — 1 cliques is a blow-up of an independent set of size 
/ — 1, which is the R{2,1) Ramsey graph. On the other hand, C5 is the i?(3,3) Ramsey graph. One 
may therefore ask if, for large n, the solution of the {k, Z)-problem is always a blow-up of an R{s, t) 
Ramsey graph, where s and t depend only on k and I. Solving this problem in general appears to be 
quite difficult. 

A simpler question, first asked by Nikiforov, is to determine the extremal graphs for the {k, l)- 
problem as one parameter is fixed and the other grows. In particular, it remains to determine for 
which values of / a disjoint union of Z — 1 cliques remains optimal for the (3, /)-problem. In light of 
the above results, one could also study for which values of k the blow-up of C5 is optimal for the 
(fc, 3)-problem. Proofs by flag algebras are infeasible for large values of k and /, as the search space 
and running time grow exponentially in these parameters. It would be of great interest to develop 
new techniques to attack this problem. 
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A Implementation of flag algebras 

In Section [3l we covered the basics of the theory behind flag algebras; here we discuss the actual 
implementation of the method. In particular, we will discuss how to set up the SDP problem, and 
then find a verifiable proof. The main steps are: 

1. Identifying the types ctj to use, and finding a suitable size t for the expansion of the positive 
semi-definite matrices. 

2. Finding a verifiable (e.g. rational) solution that leads to a proof. 

3. (Optional) Writing the positive semi-definite matrix as a sum of squares. 

We shall address each of these steps in turn. 

Identifying types: 

The process of identifying the necessary types Uj and finding a suitable size t essentially comes 
down to trial-and-error. Note that whatever choice of types and size we make will result in an SDP 
problem as outlined above, which can then be solved to provide some bound for the extremal problem. 
In order to determine whether or not this is the right bound, we need a conjecture on what the bound 
should be - this typically comes from a construction. We then seek to keep improving the flag algebra 
results until they match the conjectured bound. 

To produce the flag algebra results, we start with the initial size t to be the size of the subgraph 
J, the density of which we are trying to bound. Given t, we produce a list of all admissible graphs 
G of size t. We then consider all possible types of size suitable for expansion into graphs of size t. 
Recall that if we have a type of size k, and use flags of size / > A; + 1, then to compute a product of 
two flags, we must expand into graphs of size at least 21 — k > k + 2. This restricts the size of types 
and flags we can use - our types can be of size at most t — 2, and given a type of size k, we choose 
the largest possible size of flags / that satisfies 21 — k <t. 

For each of our types Uj, with its associated list of flags Tp , we compute the product of each pair 
of flags, which gives the corresponding block in the SDP problem. This provides the formulation of 
the SDP problem, which can then be solved numerically. 

If the numerical bound is less than the conjecture, then we do not have enough types to solve the 
problem. Thus we increase the size t, which allows the use of larger types, and repeat the process. If 
the numerical bound matches the conjecture, we then have enough types to solve the problem, and 
can proceed to finding a verifiable proof. 

At this stage, we have the block variable matrices Qi for the SDP problem. However, as they 
were computed numerically, they are subject to rounding error, and thus we cannot be certain that 
they are truly positive semi-definite matrices, nor that the bound for the extremal problem they 
provide is exactly equal to the conjectured bound. To have a rigorous proof, it is necessary to find 
solution matrices Qi whose entries are known exactly - they will ideally be rational. It can then be 
independently verified that these matrices satisfy the conditions necessary to prove the desired result. 
We now outline some of the steps that can be taken to find such a solution. 



31 



Finding a verifiable solution: 

Typically, the space of solutions will be a high-dimensional space, with many degrees of freedom 
for the entries of the matrices Qi. To try to force the solution towards rational entries, we seek to 
reduce the dimension of the search space. There are three methods we can apply: reducing the size of 
the block variables, identifying natural eigenvectors, and changing the basis to introduce zero-entries. 

Recall that for each type Uj we have the associated block variable Qi. In identifying which types 
to use, we added all possible types until we obtained the right bound. However, it is possible, and 
even likely, that some of the types are unnecessary. Given a type a, we remove it from the SDP 
problem, and run the SDP solver again. If we still obtain the correct bound, then we know the type 
a was unnecessary. If instead this results in a worse bound, then we keep o", and try removing a 
different type. In this way we arrive at a minimal set of necessary types, thus reducing the number 
of block variables in the SDP problem. 

Given a set of minimal types, there is a further reduction possible. Every type a has the natural 
group Po- of automorphisms of the underlying graph (Tq. The group Po- acts on the algebra A^ by 
relabeling the flags according to the automorphism. We can then decompose A" = AZ^ ® A"L into 

a positive and negative part, where AZ^ consists of all elements invariant under P^, while AZ. = 
■^ / G A'^ : X^^gr^ 7/ = f • For example, given the type and flags of Pigure 3, both labelings of the 
vertices of a give rise to automorphisms, and so T„ is the symmetric group on two elements. One can 
verify that F3 e AZ, Fi + F2 £ Al, and Fi - F2 £ A'L. 




!• •2 

Id o2 lo b2 Id b2 

a 

Fi F2 F3 

Pigure 12: Decomposition into positive and negative parts. 

This decomposition is useful because whenever we have / € A"^ and g € A'L, we have [[/-^llcr = 0. 
Hence given the semi-definite matrix Q for the type a, we can split it into its 'invariant' part Q"^ 
and 'anti-invariant' part Q~ . While this increases the number of block variables, they are now of 
smaller size, and hence have fewer degrees of freedom, reducing the dimension of the search space. 
Moreover, it may be that not all of these parts are necessary, so we can proceed as before to remove 
any unnecessary block variables. 

The second technique we use is that of identifying natural eigenvectors. Por this, we require an 
extremal construction that attains the conjectured bound; let G„ represent an extremal graph on n 
vertices, and let {Gn}n&- Given a type a, fix a position of a in G„. This turns G„ into a cr-flag Fn- 
The family {FnjneN represents a way to consistently label the type a in Gn- 

Recall that in the flag algebra calculations, we used the bound [[Qllo-CGn) > o„^oo(l)- If Gn is 
an extremal graph, then the bounds are tight, and so [[<3]]ct(G„) = o„^.oo(l)- Hence we must have 
Pa(Q{-77};^n) = J2Fi,F2€Tf Qft„F2Pu{Fi; Fn)pa{F2] Fn) + 0(l/n) = o„^oo(l)- Taking the limit as 
n — )■ 00, this implies that if we have a vector v defined by vp = \\Ta.n-^ooPu{F;Fn) for F € J^, 
then vp must be a zero-eigenvector of Q. Repeating this for different embeddings of the type a in 
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the extremal family of graphs {Gn} can give rise to several eigenvectors. This procedure is formally 
defined using the apparatus of ensembles of random homomorphisms in Section 3.2 of [12]. 

Having fixed this eigenvectors, we can then reduce the size of the block variables. Note that if 
we are able to remove all zero-eigenvectors this way, then we are left with positive definite matrices 
as our block variables. This leaves a little room for error, so we can replace the entries with simple 
rational entries and hope to still have a positive semi-definite matrix. 

Our final method for reducing the dimension of the search space is to change the basis to introduce 
zero entries. Ideally the new set of variables will be a rational linear combination of the previous set, 
which will lead to a solution with rational entries. Moreover, we introduce zeros in such a way as to 
split the block variables into smaller blocks. More formally, consider the general SDP problem of the 
following form: 

maximize tr{CX), subject to 

• ti{AiX) = Oj for i = 1,2, ... ,ni 

• X ^0 (that is, X is positive semi-definite) 

where X and Ai are symmetric n x n matrices for i = 1,2, . . . ,m. 

Suppose we had a rational n x n matrix M such that all entries of the first row (and hence 
column, by symmetry) of MXM'^ , except possibly the first, were zero. We can then change variables 
to modify the SDP problem into an equivalent one, as below: 

maximize tr(Cy), subject to 

• ti{AiY) = Oi for i = 1,2, . . . ,m 

• Y to 

where C = {M-^YCM-^ and At = {M-^Y AiM-^ for i = 1, 2, . . . , m. 

The solutions of both problems are related by the equation Y = MXM'^ . We can now reduce 
the dimension of the solution space by forcing all the non-principle entries of the first row/column of 
C and Ai to be zero for i = 1,2,. . . ,m. This is possible because we already have the existence of a 
solution Y with Yij = Yj^i = for j = 2,3, ... ,n, and hence this restricted solution space contains 
a solution to the original problem. This operation splits the block variable Y into a one-dimensional 
block and an (n — l)-dimensional block. We can now iterate the procedure. 

We find such a matrix M by inspecting the numerical solution to the original SDP problem, and 
using a rational approximation to an eigenvector v for the first row. We then fill in the remaining 
rows with independent vectors orthogonal to v. Note that if the solution is initially positive definite, 
there is a little room for error, so we may hope to choose a simple rational approximation without 
worsening the solution to the SDP problem. 

Expressing the solution as a sum of squares: 

If we are able to repeatedly iterate the change of basis procedure outlined above, then we will 
eventually reach a problem whose solution is a diagonal matrix. This is advantageous for two reasons. 
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First, the semi-definite programming problem reduces to a linear programming (LP) problem. This 
can be solved by only taking rational linear combinations of the entries of the variables at every 
step, and so the solution will be a rational combinations of the input to the LP problem. Hence the 
solution can be specified exactly, resulting in a verifiable proof. Second, we can write the positive 
semi-definite matrix as a sum of squares, which is easier to understand. This can lead to combinatorial 
interpretations of the proof, as we demonstrated in Section 14.11 Thus while this step is not necessary 
for solving problems with the machinery of fiag algebras, it makes the resulting proofs much more 
understandable. 

B Integer optimization problem 

In this appendix, we prove Lemma 14.81 from Section 14.21 in which we solve the integer optimization 
problem required to determine the size of the parts in the blow-up of C5 that minimizes the number 
of 4-cliques. 

Lemma B.l. Let e > be sufficiently small, and n sufficiently large. Consider the function 



9{yi,y2,y3,y4.,y5) = ^ 



i=l 



sr^ n-yi - yi+i 



Subject to the constraints that the yi be integers satisfying X]i=i 2/* = 2n and \yi — |n| < en, g is 
uniquely (up to cyclic permutation of the variables) minimized when the yi take values \_-^\ and \^'\ 
in ascending order. 

Proof. First we will show that if (yi,y2;?/3;?/4;?/5) is optimal, the yi should be as equal as possible. 
Suppose towards contradiction that this was not the case. Then there are i,j with yi — yj > 2; let 
i,j be such that this difference is maximal over all such pairs. There are two cases: 

Case 1: i and j are consecutive. 

Without loss of generality, suppose i = 2 and j = 3, so we have 2/2 — 2/3 > 2, with this difference 
being maximal. We will show that g{yi,y2 - 1,2/3 + 1,2/4,2/5) < 5(2/1,2/2,2/3,2/4,2/5), which contradicts 
our assumption of optimality. Indeed, we have 



^g = 5(2/1,2/2 - 1,2/3 + 1,2/4,2/5) -5(2/1,2/2,2/3,2/4,2/5 

'2/2 - 1\ , fys + 1\ /"- - 2/1 - 2/2 + 1 



.iH'r 



+ 
2/2 



n-2/1 
4 



- A fn - y-i - 

3 / V 3 



2/2 
2/4 



+ 



"--2/3 
4 



f^ - 2/3 - 2/4 
4 

2/4' ^ 



n-2/1 
3 



2/2 



Now let s = 2/2-2/3-I > 1, and let t = {n-y3-y4-l)-{n-yi-y2) = 2/1-2/4+2/2-2/3-1 = 2/1-2/4+s. 
li t < 0, then clearly the above expression is negative, which shows (2/1,2/2,2/3,2/4,2/5) is not optimal. 
Hence we must have t > 1. In this case, we can rewrite the above as 



As 



J) + (*)(n-!„-!«) + «("-''^ 



2/2 



+ 



s\ , /2/3 
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Prom our constraints on the variables yi, we have that 1/3 = (| + 0(e))n, n — yi — 1/2 = 
(I + 0(e)) n, s < 2en and t < Aen. These bounds imply that the main terms are those linear 
in s and t. We have 

A<7 = 4 [(1 + 0{e)) t - (4 + 0{e)) s] r? + ©((s^ + t^)n). 
5U 

In particular, for large n, this can only be non-negative if t > (4 — 0(e)) s. However, we have 
t = yi — y4 + s, and by our assumption of maximality of y2 — yz-, we have yi — 2/4 < 2/2 — ys = s + 1. 
Hence t < 2s + 1, and we have a contradiction. 

Case 2: i and j are not consecutive. 

Without loss of generality, suppose i = 2 and j = 4, with y2 — yA > 2 being the maximal difference. 
Let 

^9 = g{yi,y2 -1,2/3,2/4 + 1,2/5) -5(2/1,2/2,2/3,2/4,2/5)- 

By similar calculations to those in Case 1, we have 

A = (^*\ - (^^ ~ 1 V /^" ~ 2/5 - 2/4 - 1\ /« - 2/4 - 2/3 - 1\ _ /"- - 2/2 - 2/3\ _ /« - 2/1 - 2/2 

We define s = 2/2 — 2/4 — 1, a-iid t = (n — 2/5 — 2/4 — 1) — (?^ — 2/i — 2/2) = 2/i — 2/5 + ■5- If i < 0, then Ag < 0, 
which contradicts the optimality of (2/1,2/2,2/3,2/4,2/5)- Hence we may assume t > 1, and rewrite Ag 
in terms of s and t as before. In this case we find 

A<7 = 4 [(1 + Oie)) t - (3 + 0(e)) s] n^ + ©((s^ + t2)n). 
5U 

Hence for Ag > 0, we must have t > (3 — 0(e)) s. However, by maximality of 2/2 — 2/4, we have 
t = 2/1 — 2/5 + s < 2s + 1. The only way these equations can be satisfied is if s = 1 and 2/1 — 2/5 = 2. 
But in this case 2/1 and 2/5 are two consecutive variables with a maximal difference, and so we reduce 
to Case 1, which leads to a contradiction. 

Hence we have shown that subject to the above conditions, g is only minimised when the variables 
2/i take values |_^J or j"^] . If n = 0,1,4 (mod 5), there is only one way (up to cyclic rotation) 
that these values can be distributed, so the minimum is uniquely determined. If n = 2,3 (mod 5), 
then there are two possible distributions of the values. In each case, an easy calculation shows g is 
minimised when the values are in decreasing order. This completes the proof of the lemma. □ 

Note that we assume \yi — -rul < en only to simplify the proof. Even without this condition, 
we can prove that for any n > 12, the above result holds. However, as the flag algebra results are 
asymptotic in nature, we can only determine the unique extremal graph for the (4, 3)-problem when 
n is large. 
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